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Abstract 

A set ^ C [n] U {0} is said to be a 2-additive basis for [n] if each 
j £ [n] can be written as j = x + y,x,y G A,x < y. If we pick 
each integer in [n] U {0} independently with probabihty p = p„ — )• 0, 
thus getting a random set A, what is the probabihty that we have 
obtained a 2-additive basis? We address this question when the target 
sum-set is [(1 — a)n, (1 + a)n] (or equivalently [an, (2 — a)n]) for some 
< q; < 1. Under either model, the Stein-Chen method of Poisson 
approximation is used, in conjunction with Janson's inequalities, to 
tease out a very sharp threshold for the emergence of a 2-additive 
basis. Generalizations to fc-additive bases are then given. 
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1 Introduction 



In 1956, Erdos |3j answered a question posed in 1932 by Sidon by proving 
that there exists an infinite sequences of natural numbers S and constants 
Ci and C2 such that for large n, 



where, for k > 2, rk{n) is the number of ways of representing the integer 
n as the sum of k elements from S, a so-called asymptotic basis of order k. 
The result was generalized in the 1990 work of Erdos and Tetali |1] which 
established that there exists an infinite sequence S for which (1) was true for 
each fixed k >2, i.e., for each large n, 



To achieve this result, Erdos and Tetali constructed a random sequence S of 
natural numbers by including z in S with probability 



where C is a determined constant and zq is the smallest constant such that 
p{zo) < 1/2. They then showed that this random sequence is a.s. an asymp- 
totic basis of order k and that (2) holds a.s. for large n. 

A natural finite variant of the above problem concerns the notion of k- 
additive bases. 

Definition 1. With [n] := {1,2,..., n}, a set ^ C [n] U {0} is said to be a 
k-additive basis for [n] if each j G [n] can be written as j = X1+X2 + ■ ■ ■ + Xk, 

Note that this definition allows for Xi = Xj i ^ j. In |3], Erdos and Tetali 
showed that in some probability space, almost all infinite sequences S satisfy 
(2) and are asymptotic bases of order k. It is natural to then ask, for finite 
A, how small A can be while still being a /c- additive basis. For = 2, if 
A is a 2-additive basis we must clearly have (''^2''^"'^) > n, so that |^| > 
\/2n{l + 0(1)) = 1.4142(1 + o{l))y/n. Extensive work, using predominantly 
analytic techniques, has been done to improve this trivial lower bound to 



1^1 > 1.428v^ ([9]); |^| > 1.445v^ (0); |^| > 1.459^^ ([I3];, and |^| > 



Ci log 77, < r2 (n) < C2 log n, 



(1) 



rfc(n) = e(logn). 



(2) 
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1.463i/n ([5]). The best upper bound appears to be |^| <~ 1.871-^?! ([TU], 

In this paper, we will use a probability model in which each integer in 
[n] U {0} is chosen to be in A with equal (and low) probability p = pn- We 
will then give sharp bounds on the probability that the random set ^ is a 
/c-additive basis. It is evident that smaller numbers must be present in an 
additive basis, since, e.g., the only way to represent 1 in a 2-additive basis is 
as 1+0, and so in the random model edge effects come into play. Therefore, 
the random ensemble is unlikely to form an additive basis unless we adopt 
a different approach. This may be done in two ways, which lead to the 
following alternative definitions: 

Definition 2. A set ^ C [n — 1] U {0} is said to be a modular k-additive 
basis for [n] if each j E [n — 1]U {0} can be written as j = xi + X2 + ■ ■ ■ + Xk 
(mod n), Xi E A, i = I, . . . , k. 

Definition 3. A set ^ C [n] U {0} is said to be a truncated k-additive basis 
for [n] if each j G [an, {k — a)n] can be written as j = Xi + X2 + . . . + x^, 
Xi E A, i = 1, . . . ,k]a E (0, 1). 

It turns out that definitive results using Definition 1 have been proved in 
the papers of Yadin [H] and Sandor [12] , and we thus focus on developing 
results using Definition 2. Although our model differs from that of Erdos 
and Tetali (our set is constructed using constant probability p as opposed 
to the p{z) used in |3]), the output threshold probabilities for the size of a 
truncated (or modular) fc-additive basis end up being remarkably close to 
the input probabilities used by Tetali-Erdos to construct their asymptotic 
bases. We will stress the similarities between our results as appropriate. It 
also bears mentioning that the threshold size for our random ^ to be a 2- 
additive basis is, up to a logarithmic factor, similar to the bounds outlined 
above using the analytic techniques. 

Our work is organized as follows: We present threshold results on trun- 
cated 2-additive bases in Section 2, using the Stein-Chen method of Poisson 
approximation (see [2]), which is an alternative to the Janson inequalities 
and Brun's sieve used in [Il],[l2]. This method allows one to not just find 
the limiting probability that A forms an additive basis, but also to approx- 
imate the probability distribution C{X) of the random variable X, defined 
as the number of integers j that cannot be expressed as a sum of elements 
in A. In the modular case, we will, accordingly, show as a corollary how our 
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result partially extend those in [T3], [12] • In Section 4, we consider similar 
questions for truncated /c-additive bases; at times, the Janson exponential 
inequalities ([1], [8]) are used to estimate some critical baseline quantities 
that were calculated exactly in Section 2 for k = 2. 

Remark 4. Throughout the rest of the paper, we suppress the descriptor 
"additive" , referring simply to "truncated fc-bases," and, occasionally, "mod- 
ular /c-bases". 

2 2- Additive Bases 

We begin by investigating truncated 2-bases in our random model outlined 
above. We will then state the analogous result for modular 2-bases and we 
will explain how the modular results (and their generalization to arbitrary 
k-bases) extends the existing work in [T3] and [12]. As with all results in this 
paper, we seek to find the threshold value at which the bases emerge. 

We begin by selecting each integer from {0, 1, . . . , n} independently with 
probability p = pn- We thus obtain a random set A, and denote by X the 
number of integers in [an, {2—a)n] that cannot be written in the form X1+X2, 
Xi G A. Evidently, ^ is a truncated 2-basis if and only if X = 0. We can 
write X = ^^^^^^n]^ hi where Ij equals one or zero according as the integer 
j cannot or can be represented as a 2-sum of elements in A. To simplify the 
notation a bit, we will write simply 'Y}j'=an^ h ^"^^ Sj=[cm"^ h sequel. 
The main results of this section are summarized in the following theorem: 

Theorem 5. Pick each integer in [n] U {0} independently with probability 
P = Pn ^ thus getting a random set A. 



I. Let Y ~ Po(A = E(X)). Let 5 > 0, then for all 





it hold that drvi^^ F) — )■ (n 00). 
a. With 





An — )■ 00 ^ is a truncated 2 — basis) — )■ 1 (n — 00); 
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An — )• —oo =^ F{A is a truncated 2 — basis) — )■ (n — > oo); 

and 

An -> F{A is a truncated 2 — basis) -> exp {— 2ae~°'^/^} (n — )■ cx)) 

Proof. Here and throughout the paper, let S denote the random sumset 
generated by A. The first step in the proof will be to calculate the precise 
asymptotics of IE(X) 

Proposition 6. With X as above, 

(2-a)n n . 

E(X) = = 1) ^ 2 J] (1 - X -exp{- V«/2}. 

j=an j=an ^ 

Proof. This is a critical computation and needs to be justified for the entire 
range of p's that we encounter, in particular for all p satisfying The first 
direction is easy: 

n oo 

j=an j=an 

2(l-p2)W2 



< 



- 1-y/l-p^ 

< i^(l_p2)a„/2 

4 

< — exp{-np^a/2}. 



using the inequalities 1 — x < e ^ and -^/l — < 1 — For the other 
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direction, we have 



n 



2(1 -p2)W2 _2(1 -p2)n/2 



2^(1-/P > 



1- v^r^ 

2(1 -p2)W2(l _ (1 _p2)(2-a)n/2) 



j=an 




> 



p2 p4 

2 ~'~ 8 



4(1 -p')""/' 



— exp { — rap^ a /2 } . 



□ 



With p defined as in (g]) we see that E(X) = (l+o(l))(2a-exp{-av4„/2}). 
As is often the case with threshold phenomena, Markov's inequahty can be 
used to easily establish the first part of ii, as we have F{X > 1) < E(X) — )■ 
provided that oo in (jlj). 

To establish the second and third statements in part ii. of the theorem, 
we go beyond estimating the point probability F{X = 0), using instead the 
Stein-Chen method of Poisson approximation [2] to establish a total variation 
approximation for the distribution C{X) of X. If the distribution of X is 
approximately Poisson, then we will have — En < P(X = 0) < e~'^ + 
where A = E(X) is the mean of the approximating Poisson variable and 
En is the total variation error bound for the approximation. We have that 
E{X) ^ oo if A„ ^ -oo; E(X) ^ if v4„ ^ oo; and E{X) 2ae-"^/2 jf 
An — )■ A. Thus, if we can show that tends to zero in a window around 



we will have that F{X = 0) -> if y4„ ^ -oo; P(X = 0) ^ 1 if A„ ^ oo; 
P(X = 0) ^ exp{-2ae-"^/2} jf ^ ^ 

With Y ~ Po(A = E(X)), and throughout writing dT:y{A,B) instead of 
the more appropriate dTY{C{A), C{B)), we seek to bound 




dTv{X,Y) := sup \F{X e A) -F(Y e A)\. 



ACZ+ 



6 



Following [2], we first need to determine, for each j separately, an auxiliary 
sequence of variables Jj. defined on the same probability space with the 
property that 

C{J,,,Jj,,...) = C{h,l2,...\Ij = l). (5) 

Our explicitly constructed coupling of the Jj/s is as follows: If Ij = 1, set 
Jj. = li for all i ^ i- If Ij = 0, for all pairs Xi,X2 E A such that Xi ^ x^ and 
x\ ^ X2 = J, remove x\ from A with probability ^3^' remove X2 from A 
with probability 4^1^, and remove both x\ and X2 from A with probability 

^ X-^? ■ If 2; £ "4 and X + X = z, then we remove x from ^ with probability 
1. Finally, define J,-. = \\i i ^ S after the above coupling is implemented. 
It is clear that ^ is satisfied since we have "de-selected" offending integers 
based on the conditional probability of one or both integers in a pair being 
absent, given that both are not present. The total variation bounds derived 
in [2] are expressed in terms of the probability that the coupled indicator 
variables are different after the coupling is implemented; i.e. Jj = 1, J,. = 
or Jj = 0, Jj- = 1. Now, P(Jj = 1, Jj-- = 0) = 0, since if integer i is not present 
in 5, it cannot magically appear after some integers have been de-selected. 
The formula we need thus reduces to 

dTv (X, Y) < (^'(^^- = 1) + ^(^^- = ^) ■ E ^(^^ = 0' '^^^ = 1)) • 

There are two terms above. The first, 

1 — 1 

^p2(Jj = 1) < y maxP(Jj = 1) ^P(Jj = 1) = maxP(Jj = 1), 



A ^ ' ' ' - \ 

and thus is bounded above by P(J|-q,„] = 1) ^ (1 — p2^an/2_ This bound tends 
to zero if 1/y/n = o{p), a condition that is satisfied for p satisfying (jS]). 
To bound the second term in the sum, we begin by noting 



1 - e-^ ^ 
A ^ 



p(j, = i)5;]p(j, = o,j,, = i) 



< max Y P(Jj = 0, Jj, = 1). 



1+3 



We further bound this second term by conditioning on the number of 2-sums 
of % that are present pre-coupling, denoting this number by J?j. Denote the 
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sumset of A post-coupling by S*. We see that 



= 0, 4 = 1) = ^ P(z ^ S*\B, = k) ■ = k). 

k=l 

Note that if there exists x such that i = 2x, then P(i?i = k) = {^~j}~^)p'^''{^ — 
p2^r^l-*:-i(l — + {^~l'][^)p'^^~'^{l — p'^y^~^~''p, otherwise (when i is odd 
and f{i) := \^~\ to simphfy the notation) 



^ y\i - py''^-' + ^ y\i - py 

Our next step is to bound P(i ^ S*\Bi = k) via 

P(i ^ = k) < (P(a given 2-sum of z removed|-Bj = k))^ . 

To bound the above term, we will assume all elements of the 2-sums of i 
are part of 2-sums of j and thus have positive probability of being removed 
by the coupling process. Fix x,y x ^ y, such that x + y = i. For ease of 
notation, define to be the event that x is part of a pre-coupling 2-sum of 
j, and analogously define Py. Further define Rx to be the event x is removed 
from A by the coupling, and analogously define Ry. Consider the following 
simple calculations: 

P(P., P;, Rx) = p(l -p)(^l- ^ 

since the only undesirable outcome is if only the "other" component of the 
2-sum of is removed in lieu of x; 
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and ^ 

p(p,, p„ i?,, Ry) =p'[i- < p'- 

In the case where x = y, the only case we would need to consider is 

From this, we see that 

P(a given 2- sum of i removed | Bi = k) < 2p + 2p^ + p^. 

Let C{j>) = 2p+2p^ +p'^ . Our above calculations yield that maxj ^(-^« 
0, Jj^ = 1) < Si + S2, where 

ri+ln 



Si := maxj^ ^X^' f ^^^'J - p'Y'^^-'-'il - p){C{p)y 



and 



i^j k=i 

Now, 



Si = max ^(1 - p) [(1 - / + r'^^^"^ - (1 - 

(l-p)max^(^ ^ + 



max 



< n . ^^^^C{p)p'e~-^'/' 
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and hence Si — if p satisfies For S2, we have 

2 I /ri4-l-i 1 \ 




fc-1 



ij^j k=l 



-1 



xC(p)pmaxVe-r*/2lP' 



= o(Si) 



and we have that drvi^^ F) — )■ if p satisfies ([3j), and for p in this range 



which finishes the proof. Note that if we consider a p not in this range, the 
result will hold by monotonicity. □ 

The modular 2-basis result is proved similarly, and suppressing all details 
we state the result: 

Theorem 7. Pick each integer in [n — 1] U{0} independently with probability 
p = Pn thus getting a random set A. Then, 

i. Let X denote the number of missing integers in the modulo-2 sumset 
of A, and let Y ~ Po(A = E(X)). Let 5 > 0, then for all 



y4„, — 7- 00 ^ is a modular 2 — basis) — ?■ 1 (n — )■ 00); 
y4„ — — 00 ^ is a modular 2 — basis) — ?■ (n — )■ 00); 
A„ A G M ^ is a modular 2 - basis) -> exp {-e~^^^} {n 00). 



P(X = 0) - e~^| 




it holds that drvi^, F) — ^ 0. 
a. If we let 




then 
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The second part of Theorem [7] was proven in [T3] using Janson's correla- 
tion inequalities and in [12] using the method of Brun's sieve (see for example 
[1]). In [12], the author is able to derive part i. of Theorem [7] at the threshold 
value of p, while the Stein-Chen method allows us to derive the result in a 
window about the threshold, thus somewhat generalizing the previous known 
results. 

Remark 8. We have so far dealt with random sets of fixed expected size, 
and now indicate briefly how we can easily transition to the case of random 
sets of fixed size. This can be done for all the results in this paper, but 
we indicate the method in the context of Theorem 5: Choose one of ('^^i'^) 
families randomly, and suppose \A\ = y/Kn logn; K > ^ (this corresponds 
to the expected size of A with p = K \ogn/n). Then we reconcile the two 
models as follows with p = K log n/n: 



by Theorem 5 and the central limit theorem. A similar argument holds if 



K < ^, or even if |^| = y ^nlogn — log log + nAn, where \An\ — )■ oo. 
It follows that we can easily go back and forth from the independent model 
to the fixed set size model except possibly when we are at the threshold, i.e., 
when 



with An-^ AeR. 

3 Modular /c- Additive Bases 

Let us briefly now turn our attention to modular /c-additive bases. We define 
A as in Section 2, and will choose integers to be in A with probability p = Pn- 




< F{A is not a basis] |^| < Kn log n) 



< F{A is not a basis) /P(|^| < ^/Knlogn) 
^ 2F{A is not a basis) — > 0, 





Define 



S := {xi + X2 + . . . + Xk (mod n)\xi G A}. 
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We define Ij to be if j G 5 and 1 if j ^ 5. As before, let X = ^"=o ^J- 
it was proven, again using the method of Brun's sieve, that 



Theorem 9. With X defined as above, if we choose elements of [0,n— 1] to 
be in A with probability 



k k\ logn + An 
^ = V n^- ' 

where \An\ = o(logn), then 

{0 ifAn^ -oo 

1 if An oo 

exp{-e-^/'='} ifAn^Ae. 

Also, ifY ^ Po(E(X)), 

dTv{.X, F) — )■ as n ^ oo. 



Using the Stein-Chen method we are able to reproduce the above re- 
sult and prove the total variation convergence for all p G [po? 1], where po > 

^ "^^'fc°f " -, with A> {k — l)/k. We will provide proof for our contribution to 
the window of convergence, omitting proofs when results were derived also 
in [12] 

With X as above, we first derive (as in [12]) that 

E(X) = n ■ exp ^ 0(l/np))| . 

Following [2], we first need to determine, for each j separately, an auxiliary 
sequence of variables J^. defined on the same probability space with the 
property that 

£(j,„j,„...) = /:(/i,/2,...|/, = 1). (6) 

Such a coupling can probably be described explicitly as we did for k = 2 but 
there is no need to do so: It is clear that the more integers in A, the higher the 
probability that Ij is 0, as integer j is more likely to be representable as a k- 
sum. Therefore, the Jj's are decreasing functions of the baseline i.i.d. random 
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variables {Yi} that have distribution P(lo = 1) = P',^(Xq = 0) = 1 — p, 
and a monotone couphng satisfying exists. Thus with A := IE(X) and 
Y ~Po(A), we can apply Theorem 2.E and Corollary 2.C.4 in [2] to get 



-A / 



dMX, Y) < ( V(X) - A + 2 5^ P2(/, = 1) 

j=0 



< _ 1 + 2P(/o = 1) 

A 



Now, from the asymptotic formula of ]E(X), we have that 2P(/o = 1) 
r fc— 1 k "1 

2exp < — I" and this goes to as n — )• oo if 



P 



where (j){n) — )■ oo as n — )■ oo is arbitrary. 

We next need to show that '^^^ = 1 + o(l). 



n-1 



n-l 

V(/,) + 2 5^ (E(/,/,) - E(/,)E(/,)) 

n— 1 n— 1 

^ E(/,) - ^ + 2 J] E(J, J,) - n{n - l)E(/o)E(/0 

j=0 j=0 i<j 

X-{n + n{n- 1))E\Io) + 2 ^E(/,/j), 

i<j 

Next, we need to bound the growth of for i < j. Now E(JjJj) equals 

the probability then neither i nor j is in 5, and thus E(JjJj) = P(S' = 0), 
where 

k 

r=i,j 1=1 a:={ai,...,ai}:J2ai=r 



SO that 
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where Ja := Jr-,/,a equals one if the / integers in a that sum to r (with possible 
repetition) are all selected to be in A. We have that (see for example Lemma 
1.5 in [12]) 

E{S) = 2^/(1 + 0(l/np)). 

n 

Set {r, /, a} ~ {s, m, b} if a fl b 7^ 0; {r, /, a} 7^ {s, m, b}. For the associated 
dependency graph, we see that 

A = ^ J2 = 0(n2'=^V=-i) = 0{n''~Y~'E{S)), 

{r,l,a} {s,m,b}~{r,i,a} 

and thus by Janson's inequality we have 

Woh) < e-^(^)+^ = exp{-2-!^p'=(l + 0{l/np) + Oin'^-y-^)}. 



n 



Returning to ([7]), we see that 
_1 

A 



< n 



|^exp{-2^p*^(l + 0{l/np) + 0(n^-V^^)} - exp{-2^/(l + 0{l/np))}^ 

exp{— ^^^=(1 + 0{l/np))} 

^ „exp{-2g/(l + 0(l/.rt)} f^_,(X)^.oin^-y-.^} - 1 

exp{-i^pfc(l + 0(l/np))} V 

Thus, the total variation distance goes to if 

which holds if p > ^^^^^^3rjjr— , A > Poisson approximation is thus valid 
in a window that includes the threshold value of p = ^^^'"^"/fc' • 

4 Truncated A:-bases 

The analysis for truncated bases is more subtle than that for modular bases, 
since edge effects need to be handled carefully. Following the same setup as 
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before, we wish to represent each j G [an, {k — a)n] as j = Xi + . . . + Xk for 
Xi G A, where Xi < X2 < . . . < x^. Again, for each j, let Ij equal 1 if j 
cannot be expressed as a /c-sum of elements of A and otherwise. We set 
X := T^fjan'' and note that X = ^ ^ is a truncated /c-basis. 

Theorem 10. With X defined as above, if we choose elements of {0} U [n] 
to he in A with probability 



' K log n — K log log n + An 

p=\l ' 



where \An\ = o(loglogn) and K = Ka,k = ^'^t-i'" ; then 

{0 if An -00 

1 if An — 7- 00 

exp{-^e-^/^} zfAn^Ae: 

Also for 

kl(3Klogn 



P \l k-i 
13 > {k- l)/k, and Y ~ Po(E(X)), 

dxvi^y F) — )■ as n —)■ 00. 

Before proving this theorem, we need some preliminary work. Let Sj be 
the set of all unordered /c-tuples of nonnegative integers in {0}U [n] that sum 
to j. 

Claim 11. For j G [an, n], 

\S,\=f^'/[ik-l)\k\]+Oif^'). 

Proof. The number of ordered /c-tuples of nonnegative integers that sum to 
j is (''^^~^) ~ {k-iyr such tuples will be composed entirely of numbers in 

{0} U [j], and at most (2) ■ n ■ ("''''^~^) = of these contain a number 

repeated once or more often. We can disregard these in the asymptotic 
analysis, and consider the remaining unordered and ordered tuples. Each 
remaining unordered tuple appears k\ times among the remaining ordered 
tuples, giving us the desired first order asymptotics. □ 
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Claim 12. If 1 < j < kn/2, then \Sj\ = \Skn-j\ 

Proof. There is a bijection between /c-tuples in Sj for 1 < j < kn/2 and tliose 
in Sj for j > kn/2 given by {xi,X2, ■ ■ ■ , Xk} ^ {n — xi,n — X2, ■ ■ ■ ,n — Xk}. 
□ 

Lastly, we have: 
Claim 13. For j G [n + l,{k- l)n\, 

^ fc — 1 

Proof. We will use the fact that \Sj\, i.e. the number of partitions of j of 
size at most k each part of which is less than or equal to n, is the coefficient 
of in the g-binomial coefficient 

n + k\ _ (l-g)(l-g2)...(i_g'^+fc) 
k J/" (1 - - g2) . . . (1 - _ g)(i - g2) . . . (1 _ • 

It is well known, see for example [H], that = ^ fljg* is a polynomial 

in g and that the coefficients are unimodal, namely aj_i < aj for j < nk/2. 
Claim 11 yields that 

k—l 

and the proof now follows directly from Claim 12. □ 

We next begin our analysis of E(X) with a preliminary claim. 
Claim 14. With Sj defined as above, 

{k—a)n 

E(X)= ^ exp(-|T,|/(l + o(l))), 

j=an 

where Tj, the set of k -tuples of distinct elements that add to j satisfies 

-- j''~^/{k-iy.k\, ifjE[an,n] 
\Tj\ { > n^-^/{k - ly.kl if j e [n, {k - l)n] 

-- {kn - if^^/{k - l)\k\ if J e [{k - l)n, {k - a)n] 
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Proof. Fix j, and let {B(^ji)}-Jl be the event that the ith unordered /c-tuple 
of elements of {0} U [n] that add to j is in A. Then by Janson's inequality, 
we have 

1) = P(n,i?g,)) 

> (1 _p2)O0)(^ _p3)O0-) _ _ _ ^i_pk-i^ou^-^)^^_pkp\ 



> exp( 



1 — p \ — p"^ \ — p^' 



\T\,^( ' I Q^^"'^) I + m ^ 

exP^ \h\P [l-p>^^ il-p^~i)\T,\p^---^ {l-p)\T,\p'^~^J 
exp{-\T,\p\l + o{im 



as for X G (0, 1), 1 — X > e~^^^^~^\ and with our choice of p, np — )■ oo and 
for all j, we have j = 0{n). 

For an upper bound, Janson's inequality yields 

P(/, = 1) < exp (-|T,|/-^(1 + 0(1)) + A,/2) . 

Now Aj = 0{n'^^~^p'^^~^) since the maximal contribution to is when the 
two tuples have distinct elements and intersect in a single element; as for all 
values of p in our window, — )■ oo. Thus 

P{I, = 1) < exp (-|T,|/--i + Oin'^-Y^-^)) 
= exp {-\Tj\p\l + 0{n''-Y-^))) . 

With p = o(n~^''~'^^^^''~^^), a condition satisfied by all p's in our theorem, 
^k~2pk~i finishing the proof. □ 

We are now ready to begin the proof of Theorem [TOl 
Proof. We now have that (with (1 + o(l)) terms suppressed 

n (fc-l)n 

E(X) = 2 5^exp{-//"V(A;-l)!A;!}+ exp{-/|T,|}. 

j=an j=n 

The first summation, which we will call Ei (correspondingly calling the sec- 
ond sum S2) can be bounded above as follows, recalling the definition of K 
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from the statement of the theorem, and setting B = / k\{k — 2)\: 

oo 

El < 2 ^ exp{-/j^-V(/^ - l)\k\} 



j=an 



< 2exp{-n'=-y/K} J2 (exp{-(A; - l)(na)^"y /(A; - 

\i=o 

2exp{—n^~^p^ / K} 
1 — exp{— i?n'^~y } 



where we used the facts that < 1 — in the fourth hne and, in the 
final hne of the display, that for all choices of p considered in the theorem, 
^k-2pk _^ g_ pi]2ally, the geometric bound in the second line follows from the 
fact that the ratio of consecutive terms in the sum satisfies 

-■=(-(13^10 + 1)-' -/-']} > -p{-^^(^- - 

For S2 we have 

(fc-l)n 
j=n 

< {k-2)nexp{-p''n''~^/{k-l)\k\} ^0 

if 

P>(^)'";fl >(*-!)!*! 

A lower bound on Si (and thus on E,{X)) is obtained by using elementary 
integration by parts to derive a tight estimate for the integral 

00 

e at;x^ 00, 
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in much the same way that Gaussian tails are analyzed (which is the k = 3 
case.) We have 

n 

E(X) > 2 ^ exp{-/j'="V(A; - 1)!A;!} 

j=an 

~ 2 / exp{-pV-V(A;-l)!A;!}rfx 

J an 

POO 

= 2/ exp{-p''x'"^/{k-l)\k\}dx 

J an 

POO 

-2 / exp{-pV-V(A;- (8) 

J n 

Setting, for t > 0, k) = exp{—Cx''~^}dx, we see that 
'^{t,k) < x^-2exp{-Cx^-i}rfx 



oo 



C{k - 

and, for another two constants E, E' > 0, 

r-oo h~2 



C{k - 

^ exp{-Ct'^-i}, (9) 



= / ^^expj-Cx'^-^lrfx 
Jt X 



using ([9]). Thus, since in our context C^t'^''^^ > Ct''~'^, 

Si X ^[\~TL , exp{-a^-V-V/(fc - l)!fc!}. 
It is easy to verify that S2 = o(l)Si, and thus 

E{X) ~ Si X ^frff[ exp{-a^"V^-V/(fc - 1)!A:!}. 
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Let A = E{X) and let Y ~ Po(A). First we note that E{X) tends to 
zero, infinity, or -^e~^^^ if p is as stated as in the theorem with An — ?■ oo. 
An ^ —oo or An — )■ A respectively. To complete the proof of Theorem [TOl we 
use Poisson approximation to show that the total variation distance between 
X and Y converges to 0, so that, in particular, P(X = 0) — )■ e"'*'. 

Using Theorem 2.C of with r° = T" = 0, 

dMx,Y) < = 1) + EE(^(^^-^^) - nmm 

the above is just a variation of the bound used in the proof of Theorem O 
Now 



^p2(/j = 1) < maXjF{Ij = 1) 



< maXjexp{-|T,|/'(l + o{l))} 

< e 



-(5n'=-ip'=(l+o(l)) 



where 5 is a constant not depending on p or n, and this term converges to 

if p = \J ^n°~^ ^'^y constant G > 0. For the double sum, we start by 
using the estimates 

P(J, = 1) > exp{-|5,|/(l + 0{l/np))] 

for i = j, i, and for the cross product term E(Jj J^) we use Janson's inequal- 
ity as in our Stein-Chen treatment of Theorem 9. For fixed j,i, note that 
= 1) = = 0), where 

k 

r=j,e s=l aL:={ai,...,as}:J2<^t=r 

where Ja := Jr,s,a equals one if the s integers in a that sum to r (with possible 
repetition) are all selected to be in A. We then use Janson's inequality and 
the worst case scenario estimate for the A that arises to estimate 

E(J,J,) < exp{-(|5,| + \S,\)p\l + Oil/np)) + 0{n''-Y'-')} 
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and then conclude that 



5^5^(E(/,/,)-E(/,)E(J,)) 



< EE^^P{-(I^^I + \Si\)p'{l + 0{l/np))} (e 



SO that 



E,E.^,(E(/,/^)-E(/,)E(/,)) 



< n 



2fc-3 2A:-1 



A ^ 



A 



if A < l/n^^~^p- 



, i.e., if 



2{k - 2)\k\n^-^p^-^ 



exp{-a*^-V-V/(fc - l)\k\} -> 0, 



i.e., if p 




where /3 > (fc — as claimed. □ 



Open Problems In both the modular and truncated cases, the representa- 
tion function question has yet to be addressed. We have preliminary results 
in this direction and plan on publishing them in a future paper. 
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